Acoustic-to-articulatory inversion using a speaker-normalized HMM-based speech production model
نویسنده
چکیده
Acoustic-to-articulatory inverse mapping is a difficult problem because of its non-linear and oneto-many characteristics. We have previously developed a speech inversion method using a hidden Markov model (HMM)-based speech production model which takes into account the phonemespecific dynamic constraints of articulatory parameters. We found that the constraint significantly decreases the estimation error of articulatory parameters. However, the model was trained for each speaker and articulatory parameters were estimated in a speaker-dependent manner. In this study, we present a speaker-normalized HMM-based speech production model which is constructed from a multispeaker articulatory-acoustic database, and estimate articulatory parameters from multi-speakers’ speech signals using the model. Result shows that the estimation error of articulatory parameters for vowels is about 1.0 mm.
منابع مشابه
Cross-speaker Acoustic-to-Articulatory Inversion using Phone-based Trajectory HMM for Pronunciation Training
The article presents a statistical mapping approach for crossspeaker acoustic-to-articulatory inversion. The goal is to estimate the most likely articulatory trajectories for a reference speaker from the speech audio signal of another speaker. This approach is developed in the framework of our system of visual articulatory feedback developed for computer-assisted pronunciation training applicat...
متن کاملGeneralized variable parameter HMMs based acoustic-to-articulatory inversion
Acoustic-to-articulatory inversion is useful for a range of related research areas including language learning, speech production, speech coding, speech recognition and speech synthesis. HMM-based generative modelling methods and DNNbased approaches have become dominant approaches in recent years. In this paper, a novel acoustic-to-articulatory inversion technique based on generalized variable ...
متن کاملSpeaker verification based on fusion of acoustic and articulatory information
We propose a practical, feature-level fusion approach for speaker verification using information from both acoustic and articulatory signals. We find that concatenating articulation features obtained from actual speech production data with conventional Mel-frequency cepstral coefficients (MFCCs) improves the overall speaker verification performance. However, since access to actual speech produc...
متن کاملBoosting Automatic Speech Recognition through Articulatory Inversion
This paper explores whether articulatory features predicted from speech acoustics through inversion may be used to boost the recognition of context-dependent units when combined with acoustic features. For this purpose, we performed articulatory inversion on a corpus containing acoustic and electromagnetic articulography recordings from a single speaker. We then compared the performance of an H...
متن کاملRecognizing Dysarthric Speech due to Amyotrophic Lateral Sclerosis with Across-Speaker Articulatory Normalization
Recent dysarthric speech recognition studies using mixed data from a collection of neurological diseases suggested articulatory data can help to improve the speech recognition performance. This project was specifically designed for the speakerindependent recognition of dysarthric speech due to amyotrophic lateral sclerosis (ALS) using articulatory data. In this paper, we investigated three acro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009